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OM nucleic - nucleic search, using sw model 



Run on: 



January 13, 2007, 13:59:11 ; Search time 1015 Seconds 

(without alignments) 
10269.469 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-10-776-767-1 
1495 

1 AAATATTACGCTGGTTGCAT TATCCGAAAGTTCGGCTGGA 149 5 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



5244920 seqs, 3486124231 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



10489840 



Database 



N_Geneseq_8 : * 

1: geneseqnl98 0s : * 

2 : geneseqnl990s : * 

3 : geneseqn2 00 0s : * 

4: geneseqn2 OOlas : * 

5: geneseqn2 OOlbs : * 

6: geneseqn2002as : * 

7 : geneseqn2 002bs : * 

8: geneseqn2003as : * 

9: geneseqn2003bs : * 

10: geneseqn2003 cs : * 

11 : geneseqn20 03ds : * 

12: geneseqn2004as : * 

13 : geneseqn2004bs : * 

14 : gene seqn2 0 05s: * 

15: geneseqn2006s : * 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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AAV34335 

ID AAV34335 Standard; DNA; 1495 BP. 
XX 

AC AAV3 43 35; 
XX 

DT 27-AUG-2003 (revised) 

DT 01-MAR-1999 (first entry) 

XX 

DE Burkholderia casidae 16S rRNA gene. 
XX 

KW Burkholderia casidae; 16S rRNA gene; biological control; antimicrobial; 

KW ss. 

XX 

OS Burkholderia. 
XX 

PN W09848276-A1. 
XX 

PD 29 -OCT -199 8. 
XX 

PF 23-APR-1998; 9 8WO-US0 08 14 8 . 
XX 

PR 23-APR-1997; 9 7US - 004 45 32 P . 

PR 22-APR-1998; 9 8US- 000 63 89 8 . 
XX 

PA (PENN-) PENN STATE RES FOUND. 

PA (VIRG ) VIRGINIA TECH INTELLECTUAL PROPERTIES. 
XX 

PI Casida LE, Falkinham JO, Cain CC; 
XX 

DR WPI; 1998-610004/51. 
XX 

PT New isolated Burkholderia casidae bacterium - used for producing anti- 

PT microbial preparations active against bacteria, yeast, filamentous fungi, 

PT protozoa and algae . 
XX 

PS Claim 67; Fig 1; 88pp; .English. 
XX 

CC This is the nucleotide sequence of the 16S rRNA gene of Burkholderia 

CC casidae (Be) , a novel predator bacterium that has biocontrol activity 

CC against a broad range of microorganisms, particularly microbial 

CC pathogens. Be appears to have 5 16S rRNA operons. The DNA sequenced was a 

CC PCR product of genomic DNA from Be strain 2.2N (ATCC 55961), generated by 

CC primers 27f (see AAV34336) and 1522r (see AAV34338) . A claimed new pure 

CC culture or suspension of Be or its variant which exhibits biocontrol 

CC activity has: (a) a 16S rRNA gene comprising a sequence that is at least 

CC 97% similar to the 1495 bp sequence, and (b) a stated cellular fatty acid 

CC composition. Also claimed are: (1) a cell -free filtrate or cell, fraction 

CC prepared from a pure culture or suspension as above; (2) an antimicrobial 

CC preparation comprising an alcohol extract of a cell, cyst, culture, 

CC suspension, cell-free filtrate or cell fraction of Be; (3) a method for 

CC producing an antimicrobial preparation from Be; and (4) a method for 

CC isolating Be or variant on the basis of 16S rRNA sequence and cellular 

CC fatty acid composition. The pure cultures or suspensions can be used as 

CC biocontrol compositions for treating or preventing a disease of a plant 

CC (claimed) such as a disease caused by Alternaria, Aspergillus, Botrytis, 

CC Cercospora, Cercosporidium, Geotrichum, Mycosphaerella, Mucor, 

CC Penicillium, Phoma, Phytophthora, Plasmopora, Pseudopez iza, Puccinia, 

CC Pythium, Rhizoctonia, Rhizopus, Saccharomyces , Septoria, Sporothrix, 

CC Stemphylium, Trichophyton, Verticillium, Erwinia, Pseudomonas or 

CC Xanthomonas (claimed). The plants may be e.g. safflower, cotton, flax, 

CC oat, canola, poinsettia, chrysanthemum, corn, soybean, wheat, rice, 
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CC alfalfa, sorghum, peanut, tobacco, tomato, pepper, cucumber, lettuce, 

CC green bean, lima bean, pea, cantaloupe, musk melon, citrus fruit, grape, 

CC banana, geranium, azalea, rose, tulip, petunia, orchid, carnation, pine, 

CC yew or spruce (claimed) . The compositions can also be used in the 

CC treatment of microbial diseases of animals, fish, humans and to control 

CC protozoan and algal growth in aquatic environments. (Updated on 27-AUG- 

CC 2003 to correct OS field.) 
XX 

SQ Sequence 1495 BP; 379 A; 342 C; 473 G; 301 T; 0 U; 0 Other; 

Query Match 99.9%; Score 1493.4 ; DB 2; Length 1495; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 149 4; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

A AATATT AC GCTG GT TG CATG CCT TA CAGC AT GC AAGT CG AA CG GC AG CACG GG TG CT TG 6 0 

I M llllll INI II II II Mill II MM II INI II II II II II II II II MM II II 

A AATATT ACGCTGGTTGCATGCCTTA CAGC ATGC AAGT CGAACGGCAGCACGGGTGCTTG 6 0 
CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 

I II M II II II II II II II II Ml II II II MM II M II II II II II II II II II II II ' 

CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 
GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 18 0 

I M II MM II II M M II II III MM M II II II II II II M II II II M II II II M 

GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC ' 18 0 
TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 

I II II MM II II II II II II Ml II II MM M II II II II M II II II II II M MM 

TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 
CTACCAAGGCGACGATCAGTAGTTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGAC 3 0 0 

I II II MM MM II M M M III II II MM MM II II M M MM MM M M II M 

CTACCAAGGCGACGATCAGTAGTTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGAC 3 0 0 
ACGGCCCAGACTCTTACGGGAGGCAGCAGTGGGGAATTTTGGACAATGGGCGAAAGCCTG 36 0 

I II MINI II II II MM II M II II II II II M II M II II M II II II II II II II 

ACGGCCCAGACTCTTACGGGAGGGAGCAGTGGGGAATTTTGGACAATGGGCGAAAGCCTG 36 0 
ATCCAGCAATGCCGCGTGTGT GAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGAA 42 0 

Ml II II II II M II II II II Ml II II MM M M M M II II II II II II II II M M . 

ATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGAA 42 0 
AGAAATCCTTGGTTCTAATATAGCCGGGGGATGACGGTACCGGAAGAATAAGCACCGGCT 48 0 

I II M MM MM II MM Mill MIIMM MM II II MM IIMMMII M II II 

AGAAATC CT TGGT TC TAAT AT AGC CG GG GG AT GACG GT AC CG GAAG AATAAG CA CC GG CT 48 0 
AACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGGG 54 0 

I MIIMM MM II II lllllll IIMMMII MM M II II MM MM II II MM 

AACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGGG 54 0 
CGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGGG 60 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGGG 60 0 

AACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTAG 66 0 

I I I I I M I I I I I I I II I I I I I M I I I I I M I I I II I II I I I I I I I I I I I I I I I I I I I I I I 
AACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTAG 66 0 

CAGTGAAATGCGTAGAGATGTGGAAGAATACCGATGGCGAAGGCAGCCCCCTGGGCCAAT 72 0 

I II II II MM II II II Mil III II II MM II II II II II MINI II IIMMMII 

C AG TG AAAT GC GT AG AG AT GTGGA AG AATACCGATGGCGA AG GC AG CCCCCTGGGCCAAT 72 0 
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Qy 721 ACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCAC 78 0 

I II II II M M II II M II II Ml II II II II M II II M II II M II II M II II II I! 

Db 7 21 A CT GACG CT CATG CACG AAAG CGT GG GG AG CA AACAGG AT TAGATA CC CT GG TAGT CC AC 78 0 

Qy 7 81 GCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGCG 84 0 

I II II II II Ml I M Mil II III II II II MM 1 1 II II II II II II 1 1 II II MM II 

Db 781 GCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGCG 84 0 

Qy 841 TGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGGA 900 

I II II II Mil II MM II II III II MM II II II II II II II II II II II II II II II 

Db 841 TGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGGA 900 

Qy 9 01 CCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTACC 96 0 

I M II II II II II M M II Mill II II II II II II M M M II II II I Ml M M M II 

Db 9 01 CCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTACC 96 0 

Qy 9 61 CTTGACATGGTCGGAATCGCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACAG 10 20 

I II II II II II M II II II II III MM II II II II II I! II II II II II II MINI M 

Db 9 61 CTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACAG 1020 

Qy 1021 GTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGC 10 80 

I II II M II II II II MM II III II II MM II II II II II II II MINI II II II II 

Db 1021 GTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGC 1080 

Qy 1081 GCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAACC 114 0 

I M M M II II M M M M II Mill M II II II M II M II M II M II M II M II II 

Db 10 81 GCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAACC 114 0 

Qy 1141 GGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTCACACGTCA 12 00 

III II II II MM II MM Mill II II II M II II II MM II II MM M II II II II 

Db 1141 GGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTCACACGTCA 12 00 

Qy 12 01 T AC AATGGT CGGAAC AGAGGGTTG CC AC CC GC GAAGGGGAGCTAAT CC CAGA AAAC CG AT 12 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 12 01 T AC AATGGT CGGAAC AGAGGGTTGCC AC CCGC GAAGGGGAGCTAAT CC CAGAAAAC CG AT 12 60 

Qy 12 61 CGTAGTCCGGATTGCACTCTGCACCTCGAGTGCATGAAGCTGGAATCGCTAGTAATCGCG 13 20 

II I I I II I I I I I I I I II II I I I II I I I I II II II I I II I I II I I I I II II I I II II I I I I 

Db 12 61 CGTAGTCCGGATTGCACTCTGCACCTCGAGTGCATGAAGCTGGAATCGCTAGTAATCGCG 13 20 

Qy 13 21 GAT CAGCATGCCGCGGTGAAT ACT TTCCCGGGTTTTGTACACACCGCCCGTCACAC CATG 13 80 

1 1 1 1 1 II 1 1 II II II II 1 1 1 1 1 II 1 1 II 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 

Db 13 21 GAT CAGCATGCCGCGGTGAAT ACT TTCCCGGGTTTTGTACACACCGCCCGTCACAC CATG 13 80 

Qy 13 81 GGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGGTAG 14 4 0 

I II II M II II II II II II II IM II II II II II II II II M II M II M II II M II II 

Db 13 81 GGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGGTAG 1440 

Qy 14 41 GATTCATGACTGGGTGAAGTCGTAACAAGTAGCCGTATCCGAAAGTTCGGCTGGA 14 95 

I M II II II II II Mil II II III II II II II II II II II II II II II II II II I 
Db 1441 GATTCATGACTGGGTGAAGTCGTAACAAGTAGCCGTATCCGAAAGTTCGGCTGGA 14 95 



RESULT 2 
AAT01866 

ID AAT01866 standard; DNA; 1526 BP. 
XX 

AC AAT018 66; 
XX 

DT 16-OCT-2003 (revised) 
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DT 03-AUG-1999 (first entry) 
XX 

DE P. cepacia 16S rRNA gene sequence. 
XX 

KW 16 S rRNA ; KK01 ; primer; PCR; amplification; probe; hybridisation; 

KW detection; diagnosis; ds . 

XX 

OS Burkholderia cepacia. 
XX 

PN JP07255486-A. 
XX 

PD 09-OCT-1995. 
XX 

PF 23-MAR-1994; 94JP- 000 5173 9 . 
XX 

PR 23-MAR-1994; 94 JP - 000 51 73 9 . 
XX 

PA (CANO ) CANON KK . 
XX 

DR WPI; 1995-378541/49. 
XX 

PT Pseudomonas cepacia KK01 strain 16S rRNA gene - also related probes and 

PT primers, useful for specific detection of P. cepacia strain KK01 . 

XX 

PS Claim 1; Page 21; 21pp; Japanese. 
XX 

CC This sequence represents the 16S rRNA gene of Pseudomonas cepacia strain 
CC KK01 . Fragments of the nucleic acid sequence (see AAT01872-T02316) are 
CC useful as primers and probes for the specific detection of P. cepacia 
CC strain KK01. (Updated on 16-OCT-2003 to standardise OS field) 
XX 

SQ Sequence 1526 BP; 382 A; 352 C; 486 G; 306 T; .0 U; 0 Other; 

Query Match 91.2%; Score 13 64 ; DB 2; Length 152 6; 

Best Local Similarity 97.8%; Pred. No. 0; 

Matches 146 7; Conservative 0; Mismatches 25; Indels 8; Gaps 8; 

AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 60 

III IIIIIM llllllllllll I II II II II II II MM II M II II M I! I! 

AGATTGAACGCTGGCGGCATGCCTTACA-CATGCAAGTCGAACGGCAGCACGGGTGCTTG 7 7 
CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 

I II M II II MM II MM II III II MM II II II II II III II II II II II II II II 

CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAAC- ATGTCCTGTAGTGGGG 13 6 
GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 18 0 

I II II II II II M M II II M III II II MM II II II II II M II II II II M M II II 

GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 19 6 
TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 

III MM II II II II II II II III II II MM MM II II II II II M II M II M M II 

TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 25 6 
CTACCAAGGCGACGATCAGTAG-TTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGA 29 9 

Illlllllllllllllllllll I Ml II II II II M II M II II M II M II II M II 

CTACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGACGACCAGCCACACTGGGACTGAGA 316 
C AC GG CC CAGA CT CT TACG GG AGG CAGC AG TG GGGAAT TT TG GA CA AT GG GC GAAAGC CT 35 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 I II II II Ml II II MM II II MM II II II M II M II MM II 

C AC GGCCCAGACTCCTACGGG AGG CAGC AG TGGGGAATTTTGGACAATGGGC GAAAGC CT 3 7 6 



Qy 


i 


Db 


19 


Qy 


61 


Db 


78 


Qy 


121 


Db 


137 


Qy 


181 


Db 


197 


Qy 


241 


Db 


257 


Qy 


300 


Db 


317 
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Qy 3 60 GATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGA 419 

. Mill llllllll II II II Mill II MM II II MM I! II MM MINI MINI M 

Db 3 77 GATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGA 43 6 

Qy 4 20 AAGAAATCCTTGGTTCTAATATAGCCGGGGGATGACGGTACCGGAAGAATAAGCACCGGC 47 9 

IMMMIIMM 1 1 1 1 1 1 1 1 1 Ml M M II II M II II II II II II M II II M I 

Db 4 37 AAGAAATCCTTGGCTCTAATACAGTCGGGGGATGACGGTACCGGAAGAATAAGCACCGGC 49 6 

Qy 4 80 TAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGG 53 9 

I II II II MM II M II II Mill II II II II II MM !i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 

Db 4 97 TAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCAAGCGTTAATCGGAATTACTGG 55 6 

Qy 54 0 GCGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGG 59 9 

I MM II II MM II M M II III I MM II II II II II M II II II II II M II II M 

Db 557 GCGTAAAGCGTGCGCAGGCGGTTTGTTAAGACCGATGTGAAATCCCCGGGCTCAACCTGG 616 

Qy 6 00 G AACT GC AT TG GT GA CT GG CAGGC TAGAGT AT GG CAGAGGGG GG TAGA AT TC CA CG TG TA 65 9 

I II II M II MM II II II II II M II MM !l M M II MMI II II II MM M III 

Db 617 G AACT GC AT TG GT GACT GG CA AGC TAGAGT AT GG CAGAGG GG GG TAGA AT TC CA CG TG TA 67 6 

Qy 6 60 G CAGTGAAATG CGTAGAGATGTGG AAGAAT AC CG AT GG CG AAGG CAGC CC CC TGGG CC AA 719 

I II II II M II II II II II M IM I II II II M M M II M II II II II II II M II I 

Db 6 77 G CAGT GA AATG CGTAGAGATGTGG AGGA AT AC CG AT GG CG AAGG CAGC CC CC TGGG CC AA 73 6 

Qy 72 0 TACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCA 77 9 

I M II II M II II II II II II IM M II II II II II M II MM II II MM II II II II 

Db 7 37 TACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCA 79 6 

Qy 780 C GC CC TAAACG AT GT CAAC TAGTT GT TG GGGATT CATTTC CTTAGT AACGTAGC TA ACGC 83 9 

I M II II MM II II II II II III II II il II II II II MM II MM MM II II M II 

Db 7 97 C GC CC T A AA CG AT GT CAAC TAGTT GT TG GG GATT C ATT TC CT T AGT AA CG TAGC TA AC GC 85 6 

Qy 8 40 GTGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGG 89 9 

I II II II II II M II II II II III II II II II II M M II M II II II II II II M II II 

Db 857 GTGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGG 916 

Qy 9 00 A CC CG CA CAAG CG GT GG AT GATGT GG AT TA AT TC GATG CAAC GC GA AAAA CCTT AC CT AC 95 9 

I M II IMI II II M II II II III M II II II II II II II II MM II MM II II M II 

Db 917 A CC CG C A CAAG CG GT GG AT GATGT GG AT T A AT TC GATG CAAC GC G A AAAA CC TT AC CT AC 97 6 

Qy 960 CCTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACA 1019 

I II II M II II II II II II I III II II II II II II II II II M II II II II II II II II 

Db 977 CCTTGACATGGTCGGAATCCTGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACA 1036 

Qy 1020 GGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAG 1079 

I i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! II 1 1 M 1 1 1 1 M 1 1 1 1 I i 

Db 1037 GGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAG 1096 

Qy 10 80 CGCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAAC 1139 

I II II MMI II II II II II II III II II II M II II II II II II M II II II II II II M 

Db 10 97 CGCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAAC 1156 

Qy 1140 C GG AG GAAG GT GG GG AT GA CGTCA AG TC CT CATG GC CC TT AT GG GT AG GG C - TCA CA CG T 1198 

I M II II II II M M II II II III MM M II II II II II M II II MM I MINIM 

Db 1157 CGGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTTCACACGT 1216 

Qy 1199 C AT AC AATGGT CGGAAC AG AGGGT TG CC - ACC CG CG AAGGGG AG CT AATC CC AG AAAACC 125 7 

I II II M II M II II II II II III II II IMMMI MMMMMMMMMMM 

Db 1217 CAT ACAATGGTCGGAACAGAGGGTTGCCAACCCGCGAGGGGGAGCT AATC CCAGAAAACC 12 76 

Qy 12 58 GAT CG TAGT CC GGAT TG CA CT CTG CACC TC GAGT GC AT GAAG CTGG AATC GCTAGT AATC 1317 
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I II II II 1 1 II II II II II II III II I II II II II II j I II II II II II II II II II II 



Db 


12 77 


G AT CG T A GT C C GG AT TG C A CT CTG C A AC T C GA GT GC AT GAAGCTGGAATCGCTAGTAATC 


13 36 


Qy 


1318 


GCGGATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACC 


13 77 






i II II 11 II II II II II II II III II lllllllll IIMIIIIIIIIIilllllllll 




Db 


13 37 


GCGGATCAGCATGCCGCGGTGAATACGTTCCCGGGTCTTGTACACACCGC CCGTCACACC 


13 96 


Qy 


13 78 


ATGGGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGG 


14 37 






1 MM II II II II II II II II III II MINI II MINI II 1 II llllll llllll 


1455 


Db 


1397 


ATGGGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGG- AGGACGGTCACCACGG 


Qy ' 


1438 


TAGGATTCATGACT-GGGTGAAGTCGTAACAA-GTAGCCGTATCCGAAAGTTCGGCTGGA 


1495 






MIMIIIIIIIII Mil II III II II II II Mill II MM III II Mill III 




Db 


1456 


TAGGATTCATGACTGGGGTGAAGT CGTAACAAGGTAGCCGTATCGGAAGGTGCGGCTGGA 


1515 



RESULT 3 
AAV24295 

ID AAV24295 standard; DNA; 1535 BP. 
XX 

AC AAV2 42 95; 
XX 

DT 14-SEP-1998 (first entry) 
XX 

DE Burkholderia cepacia 16S ribosomal RNA gene. 
XX 

KW Antibacterial; ant mycobacterial ; oligonucleotide; infection; therapy; 

KW ribosome binding site; Shine -Dalgarno; ribosomal RNA; cystic firbosis; 

KW tuberculosis; ss . 
XX 

OS Burkholderia cepacia. 
XX 

PN W09814567-A2. 
XX 

PD 09 -APR -199 8. 
XX 

PF 30-SEP-1997; 9 7WO-US0 18 094 . 
XX 

PR 01-OCT-1996; 96US- 002 7729P . 
XX 

PA (ADRE-) ADVANCED RES & TECHNOLOGY INST. 
XX 

PI Martin WJ, Wisniowski P; 
XX 

DR WPI; 1998-240079/21. 
XX 

PT Use of oligo mucleot ide ( s) corresponding to bacterial 16S rRNA - for 

PT inhibiting bacterial protein expression and treating bacterial infection. 

XX 

PS Claim 26; Page 62-63; 73pp; English. 
XX 

CC This polynucleotide comprises the 16S ribosomal RNA (rRNA) gene of * 

CC Burkholderia cepacia. The invention relates to methods and compositions 

CC for the treatment of Gram-negative bacterial infections employing novel 

CC oligonucleotides as antimicrobial agents. The oligonucleotides are 

CC targeted to the Shine-Dalgarno (SD) region of prokaryotes to inhibit 

CC bacterial expression and hence inhibit bacterial infection. They 

CC preferably comprise 10-35 consecutive bases of the 3' end of a bacterial 

CC 16S rRNA (see also AAV24291-94) . An oligonucleotide may also include a 

CC transport moiety and may have DNA phosphate modifications to increase 

CC nuclease resistance, or may be formulated in a liposome. A claimed method 
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CC for treating a bacterial infection of a patient comprises administering a 

CC liposomal formulation of such an oligonucleotide. The oligonucleotides 

CC can be used particularly for treating bacterial infections in pulmonary 

CC diseases such as cystic fibrosis or tuberculosis. Since the SD sequence 

CC is not present in eukaryotic cells, the oligonucleotides provide a 

CC pathogen -specif ic therapeutic method 
XX 

SQ Sequence 1535 BP; 385 A; 355 C; 488 G; 307 T; 0 U; 0 Other; 

Query Match 9 0.2%; Score 1348.8; DB 2; Length 153 5; 

Best Local Similarity 97.6%; Pred . No. 0; 

Matches 146 4; Conservative 0; Mismatches 27; Indels 9; Gaps 9; 

AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 60 

II! MINI! Illlll I III I II II II II II I! II II II II II II II II II 

AGATTGAACGCTGGCGGCATGCTTAACA-CATGCAAGTCGAACGGCAGCACGGGTGCTTG 8 5 
CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 

I II II II II II II I MM 1 1 1 III II II II Mil II II II II I II II II Illlll II II • 

CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAAC-ATGTCCTGTAGTGGGG 14 4 
GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 18 0 

I II II Illlll I Mill III I III II II MM 1 1 II II II II II 1 1 II II I MM III II 

GAT AGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 204 
TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 

I MM II II MM II II MM III I II II 1 1 MM II II II II II Illlll M MM II 

TTCGGGCCTCGCGCTATAGGGTTGG-CGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 2 63 
CTACCAAGGCGACGATCAGTAG-TTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGA 29 9 

I MM II II II II II II II II I I Ml MM II II II II II MM MM III! II II II 

CTACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGACGACCAGCCACACTGGGACTGAGA 32 3 
C AC GG CC CAGACT CT TA CG GG AGG CAGC AG TG GGGAAT TT TG GA CAAT GGGC GA AAGC CT 35 9 

I II MM II II III II II MMMMI II MM 1 1 II 1 1 II II II II II II MM II II 

CACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTGGACAATGGGCGAAAGCCT 383 

GAT CC AG CAAT GC CG CG TG TG TGA AG AAGG CC TT CG GG TT GT AA AG CA CT TT TG TC CG GA 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I 
GATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGA 44 3 

AAGAAATCCTTGGTTCTAATATAGCCGGGGGATGACGGTACCGGAAGAATAAGCACCGGC 4 7 9 

I 1 1 1 1 1 1 M III MIIMI II Ml II II MM II INI II II II II II II II II II 

AAGAAATCCCTGGCTCTAATACAGTCGGGGGATGACGGTACCGGAAGAATAAGCACCGGC 503 
TAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGG 53 9 

Mill MM ll-ll II II MM IM II II II II II MM II II M 1 1 1 1 1 1 II I II II 1 1 

TAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCAAGCGTTAATCGGAATTACTGG 56 3 

GCGTAAAGCGTGCGC AGGCGGTTTGCTAAG AC CG ATGTGAAATC CC CGGGCT CAAC CTGG 59 9 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
GCGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCT CAAC CTGG 62 3 

G AACT GC AT TGGT GA CTGG CAGGC TAGAGT AT GG CAGAGG GG GG TAGA ATTC CA CG TG TA 65 9 

MMMMI II Illlll II II III II II II MM MM II II II MM II II II II II II 

GAACTGCATTGGTGA CTGG CAGGC TAGAGT AT GGCAGAGGGGGGTAGAATTCCACGTGTA 68 3 
G CAGTGAAATGCGTAGAGATGTGG AAGAAT AC CG ATGG CG AAGG CAGC CC CC TGGG CC AA 719 

I 1 1 1 1 1 1 1 1 II I II I 1 1 1 1 M 1 1 1 I II II II MM II II II II MM INI II II II II 

GCAGTGAAATGCGTAGAGATGTGGAGGAATACCGATGGCGAAGGCAGCCCCCTGGGCCAA 74 3 



Qy 


1 


Db 


27 


Qy 


61 


Db 


86 


Qy 


121 


Db 


145 


Qy 


181 


Db 


205 


Qy 


241 


Db 


264 


Qy 


300 


Db 


324 


Qy 


360 


Db 


384 


Qy ' 


420 


Db 


444 


Qy 


480 


Db 


5 04 


Qy 


540 


Db 


564 


Qy 


600 


■Db- 


624 


Qy 


660 


Db 


684 
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Qy 7 20 TACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCA 77 9 

I MM II II I! II II II II II III II II II II II II II I! II II II II II II II II II II 

Db 7 44 TACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCA 803 

Qy 7 80 CGCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGC 83 9 

I II II MM II II II II II M III II II MM M M II MM II II II II II II M II M 

Db 8 04 CGCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGC 86 3 

Qy 8 40 GTGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGG 89 9 

I II MMMMMIIMM Mill MMM II II II II MM llllll II II MM II II 

Db 8 64 GTG AAGTTG AC CGCC TGGG GAGTACGGT CG CAAG AT TAAAAC TC AAAGGAAT TG AC GGGG 92 3 

Qy 9 00 ACCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTAC 95 9 

I II II II llllll II II II M III MMMM IIIIMM II II M II M M MM II II 

Db 924 ACCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTAC 98 3 

Qy 960 CCTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACA 1019 

I II M II II II M II II IN III II II II II M II II II II M II II M M II II II II 

Db 9 84 CCTTGACATGGTCGGAATCCTGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACA 1043 

Qy 1020 GGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAG 1079 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 

Db 1044 GGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAG 1103 

Qy 10 80 C GC AACC CT TGTC CT TAGT TG CTA CG CAAG AG CACT CT AAGG AG AC TG CC GG TG AC AA AC 1139 

I II II II llllll II II MM III MINI MM II II MM II MM II M MM M II 

Db 1104 CGCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAAC 1163 

Qy 1140 C GG AG GA AG GT GG GG AT GA CGTCA AGTC CT CATG GC CC TT AT GG GT AG GG C - TCA CA CG T 1198 

1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i I IIIIMM 

Db 1164 CGGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTTCACACGT 1223 

Qy 1199 CATACAATGGTCGGAACAGAGGGTTGCC -ACCCGCGAAGGGGAGCTAATCCCAGAAAACC 125 7 

I II M II M M II II M II II IM II II IMMMI 1 1 1 1 M M M M M I M I M M 

Db 12 24 CATACAATGGTCGGAACAGAGGGTTGCCAACCCGCGAGGGGGAGCTAATCCCAGAAAACC 12 83 

Qy 12 58 GATCGTAGTCCGGATTGCACTCTGCACCTCGAGTGCATGAAGCTGGAATCGCTAGTAATC 1317 

MMMMMMMMMMIMM 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 

Db 12 84 CATCGTAGTCCGGATTGCACTCTGCAACTCGAGTGCATGAAGCTGGAATCGCTAGTAATC 13 43 

Qy 1318 GCGGATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACC 13 77 

ill II II M MM II II II II III II MINIMI I II II II M M II II M II II II 

Db 13 44 GCGGATCAGCATGCCGCGGTGAATACGTTCCCGGGTCTTGTACACACCGCCCGTCACACC 14 03 

Qy 13 78 ATGGGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGG 14 37 

Mill II II II II II II II II III INI II II MM II MM I M llllll MINI 

Db 14 04 ATGGGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGG-AGGACGGTCACCACGG 14 62 

Qy 1438 TAG GATT CATG ACT - GGGTGAAGTCGTAACAA -GTAGCCGTATCCGAAAGTTCGGCTGGA 1495 

M 1 11 1 M I 1 1 1 11 IIIIMMIIIIMIII lllllllllll III II IIIIMM 

Db 14 63 TAGGATTCATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTATCGGAAGGTGCGGCTGGA 1522 



RESULT 4 
ADB61689 

ID ADB61689 standard; DNA; 1535 BP. 
XX 

AC ADB616 89; 
XX 

DT 04-DEC-2003 (first entry) 
XX 
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DE 16S rRNA of Burkholderia cepacia DNA sequence. 
XX 

KW enriching mRNA; high quality bacterial mRNA; bacterial gene expression; 

KW poly-A tail; mRNA purification; oligo-dT capture; 

KW prokaryote mRNA purification; bridging oligonucleotide; targeting region; 

KW capture oligonucleotide; prokaryotic 16 S rRNA; prokaryotic 23 S rRNA; 

KW eukaryotic 17S rRNA; eukaryotic 18S rRNA; eukaryotic 28S rRNA; 

KW 28S eukaryotic rRNA bridging oligonucleotide; ds. 

XX 

OS Burkholderia cepacia. 
XX 

PN WO2003054162-A2. 
XX 

PD 03-JUL-2003. 
XX 

PF 19-DEC-2002; 2 002WO-US041014 . 
XX 

PR 20-DEC-2001; 2 00 1US - 000 2939 7 . 
XX 

PA (AMBI-) AMBION INC. 
XX 

PI Murphy GL, Whitley JP ; 
XX 

DR WPI; 2003-663255/62. 
XX 

PT Depleting or isolating targeted nucleic acids e.g. rRNA, involves using a 

PT bridging oligonucleotide comprising bridging region and a targeting 

PT region complementary to a targeted nucleic acid, and a capture 

PT oligonucleotide. 

XX 

PS Claim 4; Page 173-174; 208pp; English. 
XX 

CC This invention relates to a novel method for isolating, depleting or 

CC separating a targeted nucleic acid, such as rRNA, from a sample 

CC comprising targeted and non-targeted nucleic acids. It effects a way of 

CC enriching for non- targeted nucleic acids such as.mRNAs. Isolating 

CC sufficient quantities of high quality bacterial mRNA is a demanding 

CC process which impedes analysis of bacterial gene expression in the 

CC presence of host cells. A small percentage of bacterial mRNAs may be poly 

CC -A tailed, but these are targeted for degradation and tend to be 

CC unstable. As a result, the commonly employed method for mRNA purification 

CC with eukaryotic cells, oligo-dT capture, is ineffective. The present 

CC invention provides an alternative, more suitable method for mRNA 

CC purification from prokaryotes. The method of the invention comprises the 

CC incubation of a sample with a bridging oligonucleotide (containing a 

CC targeting region) and subsequently incubating with a capture 

CC oligonucleotide allowing the isolation of the target from the sample. The 

CC method is -useful for depleting or isolating targeted nucieic acid, for 

CC example rRNA such as prokaryotic 16S or prokaryotic 2 3S, eukaryotic 17 S 

CC or 18S or eukaryotic 2 8S rRNA, from a sample. The rRNA sequence may 

CC comprise any one of 64 fully defined sequences as given in the 

CC specification. The present sequence is that of a DNA sequence which 

CC represents the sequence of 16S rRNA of Burkholderia cepacia related to 

CC the invention. 

XX 

SQ Sequence 1535 BP; 385 A; 355 C; 488 G; 307 T; 0 U; 0 Other; 

Query Match 90.2%; Score 1348.8; DB 10; Length 1535; 

Best Local Similarity 97.6%; Pred. No. 0; 

Matches 1464; Conservative 0; Mismatches 27; Indels 9; Gaps 9; 
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Qy 1 AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 60 

I II ' lllllll I II II I I III III MM II II MM II II II II II II II II 

Db 27 AGATTGAACGCTGGCGGCATGCTTAACA-CATGCAAGTCGAACGGCAGCACGGGTGCTTG 8 5 

Qy 61 CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 

Ml II II II II I! II MM Mill II MINIM MINIMI 1 1 1 1 II I II 1 1 1 1 1 1 1 

Db 86 CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAAC-ATGTCCTGTAGTGGGG 14 4 

Qy 121 GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 18 0 

III MM II MM MM MM Ml MINIM MINI II MM II II II II MM II II 

Db 145 GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 204 

Qy 181 TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 

Ml II MINI II II MM Mill I MINI II IIIIIIMM II MM II II II II II 

Db 2 05 TTCGGGCCTCGCGCTATAGGGTTGG-CGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 263 

Qy 241 CTACCAAGGCGACGATCAGTAG-TTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGA 299 

I II II M II II II II II INI I I I MM II MM II II II II I! II II MM II II II ■ - 

Db 2 64 CTACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGACGACCAGCCACACTGGGACTGAGA 32 3 

Qy 3 00 CACGGCCCAGACTCTTACGGGAGGCAGCAGTGGGGAATTTTGGACAATGGGCGAAAGCCT 359 

1 1 1 1 I i 1 1 1 1 1 1 I II MM Ml II INI II II MM II MM II II II II INI INI 

Db 3 24 CACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTGGACAATGGGCGAAAGCCT 38 3 

Qy 3 60 GAT CC AG C A AT GC CG CG TG TG TGA AG AAGG CC TT CG GG TT GT AA AG CA CT TT TG TC CG GA 419 

IINI II II II II II II II II III II II MINI MM II II II II INI II II I! II II 

Db 3 84 GATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGA 44 3 

Qy 4 20 AAGAAATCCTTGGTTCTAATATAGCCGGGGGATGACGGTACCGGAAGAATAAGCACCGGC 47 9 

MINIMI III MIMM II III M II II II INI II II II M M M II II II II 

Db 4 44 AAGAAATCCCTGGCTCTAATACAGTCGGGGGATGACGGTACCGGAAGAATAAGCACCGGC 503 

Qy 4 80 TAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGG 53 9 

llllllllllllllllllllllllllllllllllllll 1 1 f 1 1 f 1 1 1 1 1 1 1 1 1 1 f 1 1 i I 

Db 5 04 TAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCAAGCGTTAATCGGAATTACTGG 56 3 

Qy 540 GCGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGG 599 

1 1 1 1 f 1 1 1 i 1 1 1 1 1 1 f 1 1 i 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 j j i 1 1 1 j 

Db 5 64 G CG T A AAGC GT GC GC AG GC GG TTT GC TA AG AC CG AT GT GA AATC CC CG GG CT C A AC CT GG 62 3 

Qy 6 00 G AACT GC AT TG GTGA CT GG CAGGC TAGAGT AT GG CAGAGG GG GG TAGAAT TC CA CGTGTA 65 9 

I I I I I I I I I I II I I I I I I II I M I I I I I I II M II II I I M II II II II I I II II II I M 
Db 624 GAACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCA CGTGTA 68 3 

Qy 6 60 GCAGTGAAATGCGTAGAGATGTGGAAGAATACCGATGGCGAAGGCAGCCCCCTGGGCCAA 719 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 llllllllllll IIIIIMIIIIIIMI IIMII 

Db 6 84 G CAGT GAAATG CGTAGAGATGTGG AGGA AT AC CG AT GG CG AAGG CAGC CC CCTGGG CC AA 74 3 

Qy 7 20 TACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCA 77 9 

M M M I I I I I I II II II II I I I M II I I II II M II I I I I II I I I I II II I I I I II II I 
Db 744 TACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCA 803 

Qy 7 80 CGCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGC 83 9 

IINI II II II II II II II II III MINI II MINI M II INI II II II II INI II 

Db 8 04 CGCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGC 86 3 

Qy 8 40 G TG AAGT TG AC CG CC TG GG GAGT A CG GT CG C A AG AT T A AA AC TC AA AG GA AT TG AC GG GG 89 9 

III INI M II MINI II II III II II MINI MM II II MM II II II II II II II 

Db 8 64 GTGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGG 92 3 

Qy 9 00 ACCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTAC 95 9 
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I II II II II II II INI II II III II II II II II MM II II II II II II II II II II II 

Db 924 ACCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTAC 98 3 

Qy 960 CCTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACA 1019 

I I I I I I I II I I I I I I II I I I I I I II I I I I I I I I I I II I I I I I I I I II II I I I I I I I I II 
Db 9 84 CCTTGACATGGTCGGAATCCTGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACA 1043 

Qy 1020 GGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAG 1079 

I I I I I II I I I I I I I I I I I I I I II I I I I I I II I I I I I II I I II I I I I I I I I II I I I I II I I 
Db 1044 GGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAG 1103 

Qy 10 80 CGCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAAC 1139 

1 1 1 1 1 M 1 1 II 1 1 1 1 M 1 1 1 1.M II 1 1 1 II 1 1 II 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 

Db 1104 C GC AACC CTTGTC CTTAGTTG CTA CG CAAG AG CACT CT AAGG AG AC TG CCGGTG AC AAAC 1163 

Qy 1140 C GG AG GAAG GTGG GG AT GA CG TCA AG TC CT CATG GC CC TT AT GG GT AG GG C - TCA CA CG T 1198 

I II II II II II 1 1 II II II II III II II II II MM II II II II 1 1 II II I MM 1 1 II 

Db 1164 C GG AG GAAG GTGG GG AT GA CG TCA AG TC CT CATG GC CC TT AT GG GT AG GG CT TC AC AC GT 12 23 

Qy 1199 CATACAATGGTCGGAACAGAGGGTTGCC -ACCCGCGAAGGGGAGCTAATCCCAGAAAACC 1257 

I II II II II II II II II MM Ml II II Mill II I MM II II II MM II MM II 

Db 12 24 CATACAATGGTCGGAACAGAGGGTTGCCAACCCGCGAGGGGGAGCTAATCCCAGAAAACC 12 83 

Qy 12 58 GATCGTAGTCCGGATTGCACTCTGCACCTCGAGTGCATGAAGCTGGAATCGCTAGTAATC 1317 

II II M M M II II M 1 1 1 1 II I II I II 1 1 M 1 1 M 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 12 84 CAT CGTAGT CCGG ATTG CACT CTG CAACTC GAGT GC AT GAAG CT GGAATCGC TAGT AATC 13 43 

Qy 1318 G CG GATC AG CATG CC GC GG TG AAT AC TT TC CC GG GT TT TG TA C A C A CC GC CC GT C A C A CC 13 77 

II 'M! || || || || || Ml! II: ! IMIIIIM II I I I II I I 1 1 II 1 1 II II 1 1 M 

Db 13 44 G CG GATC AG CATG CC GC GG TG AAT AC GT TC CC GG GT CT TG TA CA C A CC GC CC GT C A C A CC 14 03 

Qy 13 78 ATGGGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGG 14 37 

•I II II II II II II II II II II Ml II II II II II II II II II I II IMMI MINI 

Db 14 04 ATGGG AGTGGGTT TT AC CAGAAGT GG CT AGTCTAAC CG CAAGG - AGG AC GGTC ACCACGG 14 62 

Qy 14 38 TAGGATTCATGACT-GGGTGAAGTCGTAACAA-GTAGCCGTATCCGAAAGTTCGGCTGGA 1495 

I II II II II MM I II II Mill II II MM III MM MM III II MM MM 

Db 14 63 TAGGATTCATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTATCGGAAGGTGCGGCTGGA 1522 



RESULT 5 
ADB61690 

ID ADB61690 standard; DNA; 1488 BP. 
XX 

AC ADB616 90; 
XX 

DT 04-DEC-2003 (first entry) 
XX 

DE 16 S rRNA of Burkholderia mallei DNA sequence. 
XX 

KW enriching mRNA ; high quality bacterial mRNA; bacterial gene expression; 

KW poly-A tail; mRNA purification; oligo-dT capture; 

KW prokaryote mRNA purification; bridging oligonucleotide; targeting • region ; 

KW capture oligonucleotide; prokaryotic 16S rRNA; prokaryotic 23S rRNA; 

KW eukaryotic 17S rRNA;. eukaryotic 18S rRNA; eukaryotic 28S rRNA; 

KW 28S eukaryotic rRNA bridging oligonucleotide; ds. 

XX 

OS Burkholderia mallei. 
XX- 

PN WO2003054162-A2. 
XX 
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PD 03 - JUL-2003 . 
XX 

PF 19-DEC-2002; 2 002WO-US0 41 014 . 
XX 

PR 20-DEC-2001; 2 001US- 00029397 . 
XX 

PA (AMBI-) A MB ION INC. 
XX 

PI Murphy GL, Whitley JP; 
XX 

DR WPI; 2003-663255/62. 
XX 

PT Depleting or isolating targeted nucleic acids e.g. rRNA, involves using a 

PT bridging oligonucleotide comprising bridging region and a targeting 

PT region complementary to a targeted nucleic acid, and a capture 

PT oligonucleotide. 

XX 

PS Claim 4; Page 174; 2 08pp; English. 
XX 

CC This invention relates to a novel method for isolating, depleting or 

CC separating a targeted nucleic acid, such as rRNA, from a sample 

CC comprising targeted and non- targeted nucleic acids. It effects a way of 

CC enriching for non-targeted nucleic acids such as mRNAs. Isolating 

CC sufficient quantities of high quality bacterial mRNA is a demanding 

CC process which impedes analysis of bacterial gene expression in the 

CC presence of host cells'. A small percentage of bacterial mRNAs may be poly 

CC -A tailed, but these are targeted for degradation and tend to be 

CC unstable. As a result, the commonly employed method for mRNA purification 

CC with, eukaryotic cells, oligo-dT capture, is ineffective. The present 

CC invention provides an alternative, more suitable method for mRNA 

CC purification from prokaryotes. The method of the invention comprises the 

CC incubation of a sample with a bridging oligonucleotide (containing a 

CC targeting region) and subsequently incubating with a capture 

CC • oligonucleotide allowing the isolation of the target from the sample. The 

CC method is useful for depleting or isolating targeted nucleic acid, for 

CC example rRNA such as prokaryotic 16 S or prokaryotic 2 3S, eukaryotic 17 S 

CC or 18S or eukaryotic 2 8S rRNA, from a sample. The rRNA sequence may 

CC comprise any one 'of 64 fully defined sequences as given in the 

CC specification. The present sequence is that of a DNA sequence which 

CC represents the sequence of 16S rRNA of Burkholderia mallei related to the 

CC invention. 

XX 

SQ Sequence 1488 BP; 373 A; 344 C; 480 G; 291 T; 0 U; 0 Other; 

Query Match 8 7.8%; Score 1312.6; DB 10; Length 14 88; 

Best Local Similarity 96.4%; Pred. No. 0; 

Matches 1.43 9; Conservative 0; Mismatches 44; Indels 10; Gaps 9; 
Qy 1 AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 60 

I II lllllll IIIIIIIMIII llllllilllllllllllllMIII Ml 

Db 1 AGATTGAACGCTGGCGGCATGCCTTACA-CATGCAAGTCGAACGGCAGCACGGG - - CTTC 57 

Qy 61 CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 

1 1 II M II II 1 1 1 1 1 Ml III II II II II MM II MM I MM Mill I MM II 

Db 58 GGCCTGGTGGCGAGTGGTGAACGGGTGAGTAATACATCGGAAC-ATGTCCTGTAGTGGGG 116 

Qy 121 GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 18 0 

1 1 1 1 1 M M 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 

Db 117 GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTGAGGATGAAAGCGGGGGACC 17 6 

Qy 181 TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 
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• I 1 1 1 1 1 1 1 1 1 1 1 1 I II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 I I 1 1 I I II 1 1 II 

Db 177 TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 23 6 

Qy 241 CTACCAAGGCGACGATCAGTAG-TTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGA 29 9 

Mill II II II II II MM II! I II MM II IIMMM II II M MINIM MM 

Db 2 37 C TACC AAGG CG ACGATC AGTAGCT GGTC TG AG AGGACG AC CAGC CACACT GGGACTGAGA 296 

Qy 3 00 CACGGCCCAGACTCTTACGGGAGGCAGCAGTGGGGAATTTTGGACAATGGGCGAAAGCCT 35 9 

IMMMMIIMI M II II III II M II II M II MM M II II M I! II I MINI 

Db 2 97 CACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTGGACAATGGGCGCAAGCCT 35 6 

Qy 3 60 GATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGA 419 

Ml II MINIM II MM II III MM II II M IIMMM II II II MM MM MM 

Db 3 57 GATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGA. 416 

Qy 4 20 AAGAAATCCTTGGTTCTAATATAGCCGGGGGATGACGGTACCGGAAGAATAAGCACCGGC 47 9 

llllllll II MINI I I II II M MM IMI M M M II II M M II 

Db 417 AAGAAATCATTCTGGCTAATACCCGGAGTGGATGACGGTACCGGAAGAATAAGCACCGGC 476 

Qy 4 80 TAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGG 53 9 

Ml II MM II II II MM Mill II II II INI II MMM II II II lllllllllll 

Db 4 77 TAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATTGGAATTACTGG 53 6 

Qy 5 40 GCGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGG 59 9 

I MM II INI MM MM II III II II M II INI INI I! MM II M II II II M II 

Db 537 GCGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGG. 596 

Qy 6 00 GAACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTA 65 9 

MNMI M II M II IIMIMM IMI II II IMI II II MINI II II II I! INI II 

Db 5 97 GAACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTA 65 6 

Qy 6 60 GCAGTGAAATGCGTAGAGATGTGGAAGAATACCGATGGCGAAGGCAGCCCCCTGGGCCAA 719 

Ml MINIMI! II MM Mill I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 657 GCAGTGAAATGCGTA GAGA TGTGGAGGAAT AC CG AT GGCG AAGG CAGC CCCCTGGGCCAA 716 

Qy 7 20 TACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCA 77 9 

I II II II M M 1 1 II II II II III II II II II II II II II II II II 1 1 II II M M II II 

Db 717 TACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCA 77 6 

Qy 780 C GC CC TAAACG ATGT CAAC TAGTT GTTGGGGATT CATT TC CT TAGT AACG TAGCTAAC GC 83 9 

III INI II II MM MINIMI MINNIMM INIIIMM II II M II MINI 

Db 7 77 CGCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGC 83 6 

Qy 8 40 GTGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGG 89 9 

I II II II II II 1 1 II II II II III II II II II II II II II II II II II II II II II MM 

Db 837 GTGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGG 896 

Qy 9 00 ACC CG CACAAG CGGT GGATGATGT GGAT TAAT TCGATG CAACGCGAAAAACCTT AC CT AC 95 9 

I II II II II II II II II II II III II II II II II II II II Mil II MM II II II II II 

Db 8 97 ACC CG CACAAG CGGT GGAT GATGT GGAT TAAT TC GATG CAACGCGAAAAACC TT AC CT AC 95 6 

Qy 9 60 CCTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACA 1019 

I II II II II II II II II 1 1 1 1 I Mill MM I II IMI II II II 1 1 II II II II II 

Db 957 CCTTGACATGGTCGGAAGCCCGATGAGAGTTGGGCGTGCTCGAAAGAGAACCGGCGCACA 1016 

Qy 1020 GGTGCTGCATGGCTGTCGTCAGCT CGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAG 1079 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1017 GGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAG 10 76 

Qy 1080 CGCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAAC 1139 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
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Db 10 77 CGCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAAC 1136 

Qy 1140 CGGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGC-TCACACGT 1198 

I II II II II II II MM II II I II II MM II II II II II II II II II II I MINIM. 

Db 1137 CGGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTTCACACGT 1196 

Qy 1199 C AT AC AATGGT CG GAAC AG AGGGT TG CC - ACC CG CG AAGGGG AG CT AATC CC AG AAAA CC 1257 

I II II II M II II II M MM Ml III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II M Ml I M 

Db 1197 CATACAATGGTCGGAACAGAGGGTCGCCAACCCGCGAGGGGGAGCCAATCCCAGAAAACC 12 56 

Qy 12 58 GAT CGTAGTCCGGATTGCACTCTGCACCTCGAGTGCATGAAGCTGGAATCGCTAGT AATC 1317 

III II IIIIMM llllll II III M I II M II M II M II II II II 11 II II II II II 

Db 12 57 GAT CGTAGTCCGGATTGCACTCTGCAACTCGAGTGCATGAAGCTGGAATCGCTAGT AATC 1316 

Qy 1318 GCGGATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACC 13 77 

Ml MM II MM II MM II III M I 1 1 II M 11 I II II II II II M II II 1 1 1 1 1 1 

Db 1317 GCGGATCAGCATGCCGCGGTGAATACGTTCCCGGGTCTTGTACACACCGCCCGTCACACC 13 76 

Qy 13 78 ATGGGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGG 14 37 

I II II II II II II II II II II IM II II MM MM M MM I II llllll llllll 

Db 13 77 ATG GG AG TGGGTT TT AC CAGA AGT GG CT AG TC TA AC CG CA AG G - AGG AC GG TC AC CA CG G 14 35 

Qy 14 38 TAGGATTCATGACT- GGGTGAAGTCGTAACAA - GTAGCCGTATCCGAAAGTTC 14 88 

IIIIIIIIIIIMI I ! 1 1 1 1 1 1 1 I! 1 1 1 1 1 1 1 1 1 1 i M 1 1 1 1 III II I 

Db 14 36 TAGGATTCATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTATCGGAAGGTGC 14 88 



RESULT 6 
ADB61691 

ID ADB61691 standard; DNA ; 1610 BP. 
XX 

AC . ADB61691; 
XX 

DT 04-DEC-2003 (first entry) 
XX 

DE 16S rRNA of Burkholderia pseudomallei DNA sequence. 
XX 

KW enriching mRNA; high quality bacterial mRNA; bacterial gene expression; 
KW poly-A tail; mRNA purification; oligo-dT capture; 

KW prokaryote mRNA purification; bridging oligonucleotide; targeting region; 

KW capture oligonucleotide; prokaryotic 16S rRNA; prokaryotic 23S rRNA; 

KW eukaryotic 17S rRNA; eukaryotic 18S rRNA; eukaryotic 28S rRNA; 

KW 28S eukaryotic rRNA bridging oligonucleotide; ds. 

XX 

OS Burkholderia pseudomallei. 
XX 

PN WO2003054162-A2. 
XX 

PD 03-JUL-2003. 
XX 

PF 19-DEC-2002; 2 002WO-US041014 . 
XX 

PR 20-DEC-2001; 2 001US-00029397 . 
XX 

PA (AMBI-) A MB ION INC. 
XX 

PI Murphy GL, Whitley JP ; 
XX 

DR WPI; 2003-663255/62. 
XX 

PT Depleting or isolating targeted nucleic acids e.g. rRNA, involves using a 
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PT bridging oligonucleotide comprising bridging region and a targeting 

PT region complementary to a targeted nucleic acid, and a capture 

PT oligonucleotide. 
XX 

PS Claim .4/ Page 175; 208pp; English. 
XX 

CC This invention relates to a novel method for isolating, depleting or 

CC separating a targeted nucleic acid, such as rRNA, from a sample 

CC comprising targeted and non-targeted nucleic acids. It effects a way of 

CC enriching for non- targeted nucleic acids such as mRNAs . Isolating 

CC sufficient quantities of high quality bacterial mRNA is a demanding 

CC process which impedes analysis of bacterial, gene expression in the 

CC presence of host cells. A small percentage of bacterial mRNAs may be poly 

CC -A tailed, but these are targeted for degradation and tend to be 

CC unstable. As a result, the commonly employed method for mRNA purification 

CC with eukaryotic cells, oligo-dT capture, is ineffective. The present 

CC invention provides an alternative, more suitable method for mRNA 

CC purification from prokaryotes. The method of the invention comprises the 

CC incubation of a sample with a bridging oligonucleotide (containing a 

CC targeting region) and subsequently incubating with a capture 

CC oligonucleotide allowing the isolation of the target from the sample. The 

CC method is useful for depleting or isolating targeted nucleic acid, for 

CC example rRNA such as prokaryotic 16S or prokaryotic 23S, eukaryotic 17 S 

CC or 18S or eukaryotic 2 8S rRNA, from a sample. The rRNA sequence may 

CC comprise any one of 64 fully defined sequences as given in the 

CC specification. The present sequence is that of a DNA sequence which 

CC represents the sequence of 16S rRNA of Burkholderia pseudomallei related 

CC to the invention. 

XX 

SQ Sequence 1610 BP; 394 A; 383 C; 514 G; 319 T; O.U; 0 Other; 



Query Match 87.2%; Score 13 03; DB 10; Length 1610; 

Best Local Similarity 96.3%; Pred . No. 0; 

Matches 1419; Conservative 0; Mismatches 45; Indels 9; Gaps 8; 

Qy 1 AAATATT AC GCTGGT TG CATG CCT TACAGC ATGC AAGT CG AACGGC AG CACGGGTGCT TG 60 

III lllllll IMMIIIIIII lllllllllllllllllllllllll III 
Db 80 AGATTGAAC GCTGGC GG CATG CCT TACA - CAT GC AAGT CG AACGGC AG CACGGG - - CTT C 13 6 

Qy 61 CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 



start | next page 
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OM nucleic - nucleic search, using sw model 



Run on: 



January 13, 2007, 15:19:33 ; Search time 7967 Seconds 

(without alignments) 
10969.686 Million cell updates/sec 



Title : 



Sequence: 



US-10-776-767-1 



Perfect score: 1495 



1 AAATATTACGCTGGTTGCAT TATC CG AAAGTT CGGC TGGA 14 95 



Scoring table: I DENT ITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



86534536 seqs , 29229259966 residues 
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Minimum DB seq length: 0 
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Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
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77 
78 
79 
80 
81 
82 
83 
84 
85 
86 



/EMC_Ce 1 er ra_S IDS 3 / pt oda t a/ 2 / pna/ US 6 0 0_COMB . s eq : * 
/EMC_Ce 1 er ra_S IDS 3 / pt odat a/ 2 / pna/ US 6 0 l_COMB . s eq : * 
/EMC_Celerra_SIDS3/ptodata/2/pna/US602A_C0MB. seq: * 
/EMC_Celerra_SIDS3/ptodata/2/pna/US602B_COMB. seq: * 
/EMC_Celerra_SIDS3/ptodata/2/pna/US603_COMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pna/US604A_COMB. seq: * 
/EMC_Celerra_SIDS3/ptodata/2/pna/US604B_COMB. seq: * 
/EMC_Celerra_SIDS3/ptodata/2/pna/US605_COMB .seq: * 
/EMC_Celerraj3IDS3/ptodata/2/pna/US606_COMB .seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pna/US607_COMB .seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the. total score distribution. 
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SCORE Search Results Details for Application 1( 
Result 200701 12_102007_us-10-776- 

Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments / Sugg 

This page gives you Search Results detail for the Application 10776767 and Search Result 2007011 
start | next page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



Title : 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



January 13, 2007, 14:34:41 ; Search time 8795 Seconds 

(without alignments) 
10869.971 Million cell updates/sec 



US-10-776-767-1 
1495 

1 AAATATTACGCTGGTTGCAT . 



. TATCCGAAAGTTCGGCTGGA 1495 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



6366136 seqs, 31973710525 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



12732272 



Pos t - pro ce ss ing : 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



GenEmbl : * 



1 




gb_env : 


* 


2 




gb_pat : 


* 


3 




gb_ph : * 




4 




gb_pl : * 




5 




gb_pr : * 




6 




gb_ro : * 




7 




gb_st s : 


* 


8 




gb_sy : * 




9 




gb_un : * 




10 


gb_vi : 


* 


11 


gb_ov : 


* 


12 


gb_htg 


: * 


13 


gb_i n : 


* 


14 


gb_om : 


* 


15 


gb_ba : 


* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AR178244 

LOCUS AR178244 14 95 bp DNA linear PAT 20-APR-2002 

DEFINITION Sequence 1 from patent US 6319497. 
ACCESSION AR178244 
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VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
FEATURES 

source 



ORIGIN 



AR178244.1 GI:20219382 

Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 1495) 

Casida, L . Earl . Jr., Falkinham, J. Oliver . Ill and Cain, C .Christopher . 
Non-obligate predatory bacterium burkholderia casidaeand uses 
thereof 

Patent: US 63 1949 7 -A 1 20-NOV-2001; 
Location/Qualifiers 
1. .1495 

/organism= "unknown" 
/mol_type= "unassigned DNA" 



Query Match ■ 100.0%; Score 1495/ DB 2; Length 1495 ; 

Best Local Similarity 100.0%; Pred. No. 1.4; 

Matches 1495; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 

Db 



1 AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 60 

I MINI MM II II II MM III II II MM II II II II II II II II II II II II II II 

1 AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 60 



Qy 

Db 

Qy 
Db 



61 CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 

I II MMM MMMM MM III MM IMIIIMM II IMIIIMIIIIM II II II 

61 CACCTGGTGGCGAGTGG CGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 120 
121 GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 18 0 

I II MM II M II II II II II Ml II II MM MM II II II II II II II II II II II II 

121 GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 18 0 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 



181 TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 

I MINIM MMMM II M III MM IMIIIMM II MMMMIIMM II MM 

181 TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 
2 41 CTACCAAGGCGACGATCAGTAGTTGTCTGAGAGGACGACC AGCCACACTGGGACTGAGAC 3 0 0 

III MINI MINI II II II III II IIMIIIIIIIIMMII MINIMUM II M 

2 41 CTACCAAGGCGACGATCAGTAGTTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGAC 30 0 



301 



360 



ACGGCCCAGACTCTTACGGGAGGCAGCAGTGGGGAATTTTGGACAATGGGCGAAAGCCTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

3 01 ACGGCCCAGACTCTTACGGGAGGCAGCAGTGGGGAATTTTGGACAATGGGCGAAAGCCTG 36 0 



361 



420 



ATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGAA 

I II II II II II II II II I I II III II II I I II II II II II II II I I II II II || || || || 
3 61 ATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGAA 42.0 



421 



480 



AGAAATCCTTGGTTCTAATATAGCCGGGGGATGACGGTACCGGAAGAATAAGCACCGGCT 

IIIIIIIMIIIIIIIMIIIIII IIIIIIIIIIIIIIIIIIIMMMIIIIIIMMI 

4 21 AGAAATCCTTGGTTCTAATATAGCCGGGGGATGACGGTACCGGAAGAATAAGCACCGGCT 48 0 



481 



481 



541 



AACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGGG 

III II MM MM II II II Mill MINIM MMMM II MMMM II II II II II 

AACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGGG 



CGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGGG 

III II II II II II II M M II III M II II INI II II II II II MM II II II II M M 

541 CGTAAAGCGTGCGCAGGCGGTTTG CTAAGACCGATGTGAAATCCCCGGGCTCAACCTGGG 



540 



540 



600 



600 



6 01 AACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTAG 66 0 
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I II II II II II II M II II II III II MM II II II II II MM II II M li II II M II 

Db 6 01 AACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTAG 66 0 

Qy 6 61 C AGTG AAAT GC GT AG AG AT GT GGA AG AATA CC GATG GCGAAGGC AG CC CC CT GG GC CA AT 72 0 

I M II II II II II M II II II III MM II II II M illl II II II II II II II MM II . 

Db 661 C AG TG AAAT GC GT AG AG AT GT GGA AG AATA CC GATG GCGAAGGC AG CC CC CT GG GC CA AT 72 0 

Qy 721 ACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCAC 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I II 
Db 721 A CT GACG CT CATG CA CG AAAG CGT GG GG AG CAAACAGG AT TAGATACC CT GG TAGT CC AC 78 0 

Qy 7 81 GCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGCG 84 0 

I II M II II M II II II II II III II II II II M M II II II II II M II li II II II II 

Db 7 81 GCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGCG 84 0 

Qy 841 TGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGGA 90 0 

i ii ii ii ii ii 1 1 ii ii ii ii iii ii ii ii ii ii nil 1 1 ii 1 1 mi i ii ii 1 1 ii ii ii 

Db 841 TGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGGA 90 0 

Qy 9 01 CCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTACC 96 0 

I M M II II M M M II II II III II II II II II M II II M II II II II II II M II II 

Db 9 01 CCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTACC 96 0 

Qy 961 CTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACAG 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 9 61 CTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACAG 10 20 

Qy 1021 GTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGC 10 80 

I II M II II II II M II II II III II II II II II II II II II II II MM II M II II M 

Db 1021 GTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGC 10 80 

Qy 1081 G CAAC CC TT GT CCTT AGTT GC TAC GC AAGAGC AC TC TAAGGAGACTGC CGGTGACA AACC 1140 

I II M II II II II II II II II III II II MM II II II II MM MUM II II II II M 

Db 10 81 GCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAACC 1140 

Qy 1141 GGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTCACACGTCA 1200 

III II II MM II Illl II II III II II II II II II II II II II MMMM II M II II 

Db 1141 GGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTCACACGTCA 12 00 

Qy 1201 T AC AATGGT CGGA AC AG AGGGTTG CC AC CC GCGAAGGGGAGC TAAT CC CAGA AAAC CG AT 1260 

I II II II II II II M M II II Ml II II II II II II II II II II II IIMM II II II II 

Db 12 01 TACAATGGTCGGAACAGAGGGTTGCCACCCGCGAAGGGGAGCTAATCCCAGAAAACCGAT 12 60 

Qy 12 61 C GT AGTC CGGATTGC AC TC TG CAC CT CG AGTG CATG AAGC TGGAAT CG CT AG TAAT CG CG 13 20 

I II II II II II II I I II II II III II II II II II II II II II II II I III II II II II II 
Db 12 61 CGT AGTC CGGA TTGCACTCTG CAC CT CGAGTGCATG AAGC TGGAATCGCT AG TAAT CGCG 13 20 

Qy 13 21 GATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACCATG 13 80 

III II II MINI II Illl Mill II MMMM MM II II MMIIMM II II M II 

Db 13 21 GATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACCATG 13 80 

Qy 13 81 GGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGGTAG 1440 

Ml M II II M II M M II II III II MM II M II M M II II II II II II II II II II 

Db 13 81 GGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGGTAG 1440 

Qy 14 41 GAT TC ATGACTGG GTGAAGTC GTA AC AAGT AG CC GT AT CC GA AAGT TC GG CTGG A 14 95 

Mill II MINI II MM Mill MMMM II II II II II II MMMM III 

Db 14 41 GATTCATGACTGGGTGAAGTCGTAACAAGTAGCCGTATCCGAAAGTTCGGCTGGA 14 95 



RESULT 2 
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AR473471 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 



FEATURES 

source 



ORIGIN 



linear PAT 20-FEB-2004 



AR4 73471 14 95 bp DNA 

Sequence 1 from patent US 6689357. 
AR473471 

AR473471.1 GI:42708950 



Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1495) 

Casida,L.E. Jr., Falkinham , J .0 . Ill and Cain,C.C. 
Non-obligate predatory bacterium Burkholderia casidae and uses 
thereof 

Patent: US 6689357-A 1 10-FEB-2004; 

Virginia Tech Intellectual Properties, Inc. and The Penn State 
Research Foundation; Blacksburg, VA 

Location /Qualifiers 

1. .1495 

/organism= "unknown" 
/mol_type= "genomic DNA" 



Query Match 100.0%; Score 1495; DB 2; 

Best Local Similarity 1^00.0%; Pred. No. 1.4; 
Matches 1495; Conservative 0; Mismatches 0; 



Length 14 95; 



Tndels 



0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 



AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 6 0 

III II MM II I! II II MINN MM II IMIIIIIil II I! II MM II I! I! II i; 

AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 6 0 
CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 

Mill Mil II II II II M II MMMI MMM II MM II II II II II II II II II II 

CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 
GAT AG CC CGGC GA AAGC CGGATTA AT AC CG CATACG AT CT AC GG ATGAAAGCGGGGGACC 18 0 

I II M II II M II II M II II III II II MM II MM II M II M II II II II II II II 

G AT AGCC CGGC GAAAGCCGGATTA AT AC CGCATACG AT CT AC GG ATGAAAGCGGGGGACC 18 0 
TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 

1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 } 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 
CTACCAAGGGGACGATCAGTAGTTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGAC 30 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 f i r 1 1 1 1 1 1 r r 1 1 1 1 1 1 ( i 1 1 1 1 1 1 e 1 1 1 1 1 1 r i r 1 1 r i r 1 1 1 1 1 f i 

CTACCAAGGCGACGATCAGTAGTTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGAC 300 



I I I I I I I I I I I I I I I I I \ I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 



1 1 1 1 1 1 1 1 1 1 1 ! I 1 1 1 j 1 1 1 1 j 1 1 M 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 



i i i i 'iii i j : 1 1 ii : I 1 



I II I 



II II 



IIIIIIIIMIIIII lllll IIIIIIIIIIIIMIMMIIIII llllll IIIIM 
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Qy 541 CGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGGG 60 0 

I !! M II II II II II II II II II! II II M II II II II II II II I! II I! II II II II II 

Db 541 CGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGGG 60 0 

Qy 6 01 AACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTAG 66 0 

III INI MM II II II II II III II II INI II II MM INI II II MM II INI II 

Db 6 01 AACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTAG 66 0 

Qy 6 61 CAGTGAAATGCGTAGAGATGTGGAAGAATACCGATGGCGAAGGCAGCCCCCTGGGCCAAT 72 0 

III MM II II II II II II II III II II II II MM MM II II M II MM II MM II 

Db 6 61 CAGTGAAATGCGTAGAGATGTGGAAGAATACCGATGGCGAAGGCAGCCCCCTGGGCCAAT 72 0 

Qy 721 ACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCAC 78 0 

III MM MM M MM II II III II II II II MM MM II II II II MM II II II II 

Db 721 ACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCAC 78 0 

Qy 781 GCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGCG 84 0 

I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 GCCCTAAACGATGTCAACTAGTTGTTGGGGATTCATTTCCTTAGTAACGTAGCTAACGCG 84 0 

Qy 841 TGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGGA 90 0 

III MINI II II II II 1 1 II III II II II II II II 1 1 II II II II II II II II II 1 1 II 

Db 841 TGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGGA 900 

Qy 9 01 CCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTACC 96 0 

I I I I I I II I I I I I I I I I I I I I I II I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 9 01 CCCGCACAAGCGGTGGATGATGTGGATTAATTCGATGCAACGCGAAAAACCTTACCTACC 96 0 

Qy 961 CTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACAG 10 20 

I II MIIMIIII MMM II III II II MM II II MM II II IIMIMI II MM II 

Db 961 CTTGACATGGTCGGAATCCCGCTGAGAGGTGGGAGTGCTCGAAAGAGAACCGGCGCACAG 10 20 

Qy 1021 GTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGC 1080 

I II II II II II II II II II II Ml II II II M II II II II II II M M II II II II M II 

Db 1021 GTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGC 10 80 

Qy 1081 GCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAACC 1140 

I II M II II II II II II II II Ml II II II II II II II II II II II MM II II II II II 

Db 10 81 GCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAACC 1140 

Qy 1141 GGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTCACACGTCA 12 00 

I II II II II II II II II II II III II II II II Mil II II II II II II II II II MM II 

Db 1141 GGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTCACACGTCA 12 00 

Qy 12 01 T AC AATG GT CG GAAC AG AGGG TTG CC AC CCGC GAAG GGGAGC TAAT CC CAGAAAAC CG AT 12 60 

• I M M M II M II II II II II III M II II II II II II II II II II II II M II II II II 
Db 12 01 T AC AATGGT CGGAAC AGAGGGTTG CC AC CCGC GAAG GGGAGC TAAT CC CAGAAAAC CG AT 12 60 

Qy 12 61 C GT AGTC CGGATTGC ACTC TG CAC CT CG AGTG CATG AAGCTGGAAT CG CT AG TAAT CG CG 13 20 

I II II MM II II II II II Mill MM MINI II II II MM II MM MM II II II 

Db 12 61 CGT AGTC CGGATT GC AC TCTG CAC CT CG AGTG CATG AAGCTGGAAT CG CT AG TAAT CG CG 13 20 

Qy 13 21 GATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACCATG 13 80 

I II II II II II II II M M II III II II II II II II II II II II II M II II M II II M 

Db 13 21 GATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACCATG 13 80 

Qy 13 81 GGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGGTAG 1440 

I M II II II II II II II II II Ml II II M II II II II II II II II II II II II II II II 

Db 13 81 GGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGGTAG 14 40 
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Qy 14 41 GATTCATGACTGGGTGAAGTCGTAACAAGTAGCCGTATCCGAAAGTTCGGCTGGA 14 95 

I II II II MM II II II II II III II MM II II M II II II II II I! I! II II ! 

Db 14 41 GATTCATGACTGGGTGAAGTCGTAACAAGTAGCCGTATCCGAAAGTTCGGCTGGA 14 95 



RESULT 3 
AF226727 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



rRNA 
ORIGIN 



AF226727 1495 bp DNA linear BCT 13-SEP-2000 

Burkholderia sp . casida 16 S ribosomal RNA gene, partial sequence. 
AF226727 

AF226727.1 GI:7107388 

Burkholderia sp . casida 
Burkholderia sp . casida 

Bacteria; Proteobacteria; Betaproteobacteria / Burkholder iales ; 
Burkholderiaceae ; Burkholderia . 

1 (bases 1 to 1495) 

Cain,C.C, Henry, A. T., Waldo, R.H. Ill, Casida, L.J. Jr. and 
Falkinham, J . 0 . III. 

Identification and characteristics of a novel Burkholderia strain 

with broad -spectrum antimicrobial activity 

Appl. Environ. Microbiol. 66 (9), 4139-4141 (2000) 

10966443 

2 (bases 1 to 1495) 

Cain,C.C. and Falkinham,. J. 0. III. 
Direct Submission 

Submitted ( 19 -JAN -200 0) Biology, Virginia Tech, Fralin" 
Biotechnology Center, Blacksburg, VA 24061-0346, USA 

Locat ion /Qualifiers 

1. .1495 

/organism= "Burkholderia sp. casida" 
/mol_type= "genomic DNA" 
/strain= "casida" 
/db_xr ef = " taxon : 1 15 54 6 " 
<1. .>i495 

/product=" 16S ribosomal RNA" 



Query Match 100.0%; Score 1495; DB 15; Length 1495; 

Best Local Similarity 100.0%; Pred. No. 1.4; 

Matches 1495; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 

Db 



1 AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 60 

I M II II MM II II II M II III II MINI M MM II II II II II II MM II MM 

1 AAATATTACGCTGGTTGCATGCCTTACAGCATGCAAGTCGAACGGCAGCACGGGTGCTTG 60 



Qy 

Db 



61 CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 

I llllllll II II II II II Mill II MM II II IIMIIMM MM MM lllllll 

61 CACCTGGTGGCGAGTGGCGAACGGGTGAGTAATACATCGGAACAATGTCCTGTAGTGGGG 12 0 



Qy 121 GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 18 0 

I II II II MM II II MINI III II II MM II II II M MM M M II II M II MM 

Db 121 GATAGCCCGGCGAAAGCCGGATTAATACCGCATACGATCTACGGATGAAAGCGGGGGACC 180 

Qy , 181 TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 

I M MM II II II II II II II III II II MM II MM II MM MM II II II II Ml 

Db 181 TTCGGGCCTCGCGCTATAGGGTTGGCCGATGGCTGATTAGCTAGTTGGTGGGGTAAAGGC 24 0 



Qy 



Db 



2 41 CTACCAAGGCGACGATCAGTAGTTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGAC 30 0 

I II II II II M II II II II II Ml II II II M II II II II MM M II II M II II II II 

2 41 CTACCAAGGCGACGATCAGTAGTTGTCTGAGAGGACGACCAGCCACACTGGGACTGAGAC 3 0 0 
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Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy . 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


.721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 



I li M II II II MINI II II Ml M II M II II II II II II II li II II II II MM II 



ATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGAA 42 0 

III MM II MIJ II II II Mill II II MM MM II MM MM II MM II MM M 

ATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTTGTCCGGAA 42 0 



I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I II II I I I I I I I 



AACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGGG 54 0 

Ml MM II MM II II II M III II II II II MM II MM M II II II M MMII M 

AACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAATCGGAATTACTGGG 54 0 
CGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGGG 60 0 

III II II II MM II II II Mill II II MM MM II MM II II II MM MM II II 

CGTAAAGCGTGCGCAGGCGGTTTGCTAAGACCGATGTGAAATCCCCGGGCTCAACCTGGG 60 0 
AACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTAG 66 0 

I MM IIMIMI MM II Mill MM II MM IMIMIIMM II MM II MUM 

AACTGCATTGGTGACTGGCAGGCTAGAGTATGGCAGAGGGGGGTAGAATTCCACGTGTAG 66 0 
CAGTGAAATGCGTAGAGATGTGGAAGAATACCGATGGCGAAGGCAGCCCCCTGGGCCAAT 72 0 

Ml Mil Mil MM II II II Ml II MM II Mil MINIM MM II II II II II II 

C AGTGAAAT GCGT AG AGAT GTGGA AG AATACC GATGGC GAAGGC AG CC CC CT GGGC CAAT 72 0 
ACTGACGCTCATGCACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCAC 78 0 

IIIIIIIIIIIMIIMII Mill IIIIIIIIIIIIIIMIIIMI IIMIMIIIIIII 

ACTGACG CTCATG CACG AAAG CGT GGGG AG CAAACAGGATTAGATACC CTGGTAGT CCAC 78 0 



Ml IIIIIIIIIIIIIIIIIIIIMIIIIIII IIIIIMIIIIIM MMII IIIIMI 



TGAAGTTGACCGC CTGGGGAGTACGGTCGCAAGATT AAAACTCAAAGGAATTGACGGGGA 900 

1 1 1 1 J 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 f 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f i j 1 1 1 1 1 

TGAAGTTGACCGCCTGGGGAGTACGGTCGCAAGATT AAAACTCAAAGGAATTGACGGGGA 900 



IIIIIIMIIII lllllllllllllllll IIIIIIIIIIMIIIIIMI MINI I 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



GTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGC 10 80 

I II 1 1.1 1 II II II II II 1 1 II III II II 1 1 II II II IIIIIMIIMI.il II II II II II 

GTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGC 10 80 

GCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAACC 114 0 

I II II II II MMII MM II III MM MM II II M MM II II II II II II II II II 

GCAACCCTTGTCCTTAGTTGCTACGCAAGAGCACTCTAAGGAGACTGCCGGTGACAAACC 114 0 

GGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTCACACGTCA 12 00 

I M II M II II II M II II II III II II M II II II II II II II II II MM II II II II 

GGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTATGGGTAGGGCTCACACGTCA 12 00 
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Qy 12 01 TACAATGGTCGGAACAGAGGGTTGCCACCCGCGAAGGGGAGCTAATCCCAGAAAACCGAT 12 60 

I II II I! II II II II II II Mill II II MM MM II II II II MM II II II II II II 

Db 12 01 TACAATGGTCGGAACAGAGGGTTGCCACCCGCGAAGGGGAGCTAATCCCAGAAAACCGAT 12 60 

Qy 12 61 CGTAGTCCGGATTGCACTCTGCACCTCGAGTGCATGAAGCTGGAATCGCTAGTAATCGCG 13 20 

I II II II II II M II I! II II MMI II II II II II II II M II II II M M M II II II 

Db 12 61 CGTAGTCCGGATTGCACTCTGCAC CTCGAGTGCATGAAGCTGGAATCGCTAGTAATCGCG 13 20 

Qy 13 21 GATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACCATG 13 80 

I II II II MM II II II II Mill II MINI MM II II M MM II M MM II M M 

Db 1321 GATCAGCATGCCGCGGTGAATACTTTCCCGGGTTTTGTACACACCGCCCGTCACACCATG 1380 

Qy 13 81 GGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGGTAG 14 4 0 

I II II M II II II II II II Mill M Mllll il II II II II MM II II MM II il II 

Db 13 81 GGAGTGGGTTTTACCAGAAGTGGCTAGTCTAACCGCAAGGAAGAACGGTCCCCACGGTAG 14 4 0 

Qy 14 41 GATTCATGACTGGGTGAAGTCGTAACAAGTAGCCGTATCCGAAAGTTCGGCTGGA 14 95 

I MM MM II M II II M II III MM II MMI MM MMI MINI MM Ml 

Db 14 41 GATTCATGACTGGGTGAAGTCGTAACAAGTAGCCGTATCCGAAAGTTCGGCTGGA 14 95 



RESULT 4 
BSP4913 04 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
rRNA 



BSP491304 1523 bp DNA linear BCT 18-JUN-2003 

Burkholderia sp . CEB01056 16S rRNA gene, strain CEB01056. 
AJ491304 

AJ4 91304. 1 GI : 31873152 
16S ribosomal RNA; 16S rRNA gene. 
Burkholderia sp . CEB01056 
Burkholderia sp . CEB01056 

Bacteria; Proteobacteria; Betaproteobacter ia ; Burkholder iales ; 

Burkholderiaceae; Burkholderia. 

1 

Ramisse,V., Balandreau, J. , Thibault,F., Vidal,D., Vergnaud,G. and 
Normand,.P. 

DNA - DNA hybridization study of Burkholderia species using genomic 
DNA macro-array analysis coupled to reverse genome probing 
Int. J. Syst. Evol. Microbiol. 53 (Pt 3), 739-746 (2003) 
12807195 

2 (bases 1 to 1523) 

Ramisse,V. 

Direct Submission 

Submitted (21 - JUN-2002) Ramisse V., Microbiol ogie , Centre d' Etudes 
du Bouchet, BP3 , 91710 Vert le Petit, FRANCE 

Location/Qualif iers 

1. .1523 

/organism= "Burkholderia sp. CEB01056" 
/mol_type= "genomic DNA" 
/strain="CEB01056" 
/db_xref = " taxon : 1 98 80 9 " 
1. .1523 

/gene="16S rRNA" 
1. .1523 

/gene="16S rRNA" 

/product=" 16S ribosomal RNA" 



ORIGIN 



Query Match 92.0%; 
Best Local Similarity 98.3%; 
Matches 1474; Conservative 



Score 13 75.2; 
Pred. No. 5.9; 
0; Mismatches 



DB 15; Length 1523; 
18; Indels 8; Gaps 



8; 
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SCORE Search Results Details for Application 
10776767 and Search Result 
20070112_102010_us-10-776-767-l.rst. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10776767 and Search Result 

20070112_102010_us-10-776-767-l.rst. 

start [ next page 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Bioccelera tion Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: January 13, 2007, 14:51:42 ; Search time 7704 Seconds 

(without alignments) 
10851.428 Million cell updates/sec 

Title : US- 10 -776 -767-1 

Perfect score: 1495 

Sequence: 1 AAATATTACGCTGGTTGCAT TATCCGAAAGTTCGGCTGGA 1495 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 48236798 seqs , 27959665780 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0%' 
Maximum Match 10 0% 
Listing first 45 summaries 



96473596 



Database 



EST: * 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 



gb_estl : * 
gb_est3 : * 
gb_est4 : * 
gb_est5:* 
gb_est6 : * 
gb_htc: * 
gb_est2 : 
gb_est7 ; 
gb_est8 ; 
gb_est9 : * 
gb_gssl : * 
gb_gss2 : * 
gb_gss3 : * 
gb_gss4 : * 
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Pred. No. is the number of results predicted by chance to have a • 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the' total score distribution. 
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ALIGNMENTS 

RESULT 1 
BH771024 



http://es/ScoreAccessWeb/Gem^^ 1/24/07 



SCORE Search Results Details for Application 10776767 and Search Result 200701 12_1... Page .1 of 17 
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10776767 and Search Result 
20070112 102013 us-10-776-767-l.rni. 
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This page gives you Search Results detail for the Application 10776767 and Search Result 

20070112_102013_us-10-7?6-767-l.rni. 

start | next pag e 
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GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



Title : 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



January 13, 2007, 15:15:32 ; Search time 273 Seconds 

(without alignments) 
10246.548 Million cell updates/sec 

US-10-776-767-1 
1495 

1 AAATATTACGCTGGTTGCAT TATCCGAAAGTTCGGCTGGA 14 95 

IDENTITYJSTUC 

Gapop 10.0 , Gapext 1.0 

1403666 seqs, 935554401 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



2807332 



Database : Issued_Patents_NA: * 

1 : /EMC_Celerra_SIDS3 /ptodata/2 
2 : /EMC_Celerra_SIDS3 /ptodata/2 
3 : /EMC_Celerra_SIDS3 /ptodata/2 
4 : /EMC_Celerra_SIDS3 /ptodata/2 

5 : /EMC_Celerra_SIDS3 /ptodata/2 

6 : /EMC_Celerra_SIDS3 /ptodata/2 

7 : /EMC_Celerra_SIDS3 /ptodata/2 

8 : • /EMC_Celerra_SIDS3 /ptodata/2 

9 : /EMC_Celerra_SIDS3 /ptodata/2 
10: /EMC_Celerra_SIDS3/ptodata/ 



/ina/l_COMB. seq: * 
/ina/5_COMB. seq: * 
/ina/6A_COMB . seq : * 
/ina/6B_COMB .seq : * 
/ina/7_COMB. seq: * 
/ina/H_C0MB. seq: * 
/ina/PCTUS_COMB. seq: * 
/ina/PP_COMB . seq : * 
/ i na /RE_COMB .seq:* 
2/ina/backf ilesl . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-063-898-1 

Sequence 1, Application US/09063898 
Patent No. 6319497 
GENERAL INFORMATION: 

APPLICANT: Casida, Jr., Lester E, 



http://es/ScoreAccess Web/GetItem.action?AppId= 1 0776767&seqld= 1 0456 1 7&ItemName=2 . 1 /24/07 



SCQRE Search Results Details for Application 10776.767 and Search Result 200701 12_1 ... Page 1 of 17 



§C©IRE Search tesyflts Details for Application 

10776767 and Search Result 
20070112_102021_us-lQ-776-767-l,rnpbm. 
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This page gives you Search Results detail for the Application 10776767 and Search Result 
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GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



Title : 

Perfect score: 
Sequence: 



January 13, 2007, 15:27:11 ; Search time 2057 Seconds 

(without alignments) 
8930.488 Million ce 11 . updates /sec 



US-10-776-767-1 
1495 

1 AAATATTACGCTGGTTGCAT . 



. TATCCGAAAGTTCGGCTGGA 1495 



Scoring table: I DENT ITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 18892170 seqs, 6143817638 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 20 00 00 00 00 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



37784340 



Database : 



Publ is hed_Appl ic 

1: /EMC_Celerra 

2: /EMC_Celerra 

3: /EMC_Celerra 

4: /EMC_Celerra 

5: /EMC_Celerra 

6: /EMC_Celerra 

7: /EMC_Celerra 

8: /EMC_Celerra 

9: /EMC_Celerra 

10: /EMC_Celerra 

11: /EMC_Celerra 

12: /EMC_Celerra 

13: /EMC_Celerra 

14: /EMC_Celerra 

15: /EMC Celerra 



at ions_NA_Main: * 

SIDS3/ptodata/2/pubpna/US07_PUBCOMB .s 
SIDS3/ptodata/2/pubpna/US08_PUBCOMB .s 
SIDS3/ptodata/2/pubpna/US09A_PUBCOMB. 
SIDS3/ptodata/2/pubpna/US09B_PUBCOMB. 
SIDS3/ptodata/2/pubpna/US09C_PUBCOMB. 
SIDS3/ptodata/2/pubpna/US10A_PUBCOMB. 
SIDS3/ptodata/2/pubpna/US10B_PUBC0MB. 
SIDS3/ptodata/2/pubpna/US10C_PUBCOMB. 
SIDS3/ptodata/2/pubpna/US10D_PUBC0MB. 

SIDS3/ptodata/2/pubpna/US10E_PUBCOMB 
_SIDS3/ptodata/2/pubpna/US10F_PUBCOMB 
_S I DS 3 / pt oda t a / 2 / pubpna /US 1 0G_PUB COMB 
_SIDS3/ptodata/2/pubpna/USllA_PUBC0MB 
_SIDS3/ptodata/2/pubpna/USllB_PUBC0MB 
_SIDS3/ptodata/2/pubpna/USllC_PUBC0MB 



eq : * 
eq : * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
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16: /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllD_PUBCOMB . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No . 


Score 


Match 








Description 






1 


1495 


100 


.0 


1 49R 


-J 
J 


TTQ-09-9ft^ - ft 46 -1 

UO U3 jOj O *± O X 


Sequence 


1, 


Appli 




2 


1495 


100 


.0 


1495 


g 


US -10-776 -767 -1 

UO XV 1 1 \J I o / i 


Sequence 


1, 


Appli 




3 


1348, 


. 8 


90 


.2 


1 R7 S 


7 


tto _ I 0-079 -797A-41 


Sequence 


41, 


Appl 




4 


1312. 


. 6 


87 


.8 


1 4ft ft 
J. 1 o o 


7 


TTQ-1 0-079 -797Z1-47 
UO XU U^-7 j y 1 1\ 


Sequence 


42, 


Appl 




5 


1303 


87 


.2 


1610 


7 


TTQ-1 0-099 -797A-47 


Sequence 


43 , 


Appl 




6 


1268. 


. 4 


84 


.8 


152 2 


1 6 


US -11 -198-242-2 

UO XX > i JO £ *J £, 


Sequence 2, 


Appl i 




7 


1192. 


. 6 


79 


.8 


1 4S 1 


7 


TTQ-10-719- c ;49-1 

UO X U Z 1 J J *± ~ X 


Sequence 


1, 


Appli 




o 
o 


1148. 


. 4 


76 


.8 


1 c-jo 

X D J ^ 


7 


TTQ- 1 0-079 -797Zi-7ft 


Sequence 


38, 


Appl 




Q 

y 


1135. 


. 8 


76 


.0 


1 4ft R 


7 


TTQ-1 0-079 -TQTfi-^Q 
uo xu \j £ y j y t e\ j y 


Sequence 


39, 


Appl 




1 0 

X U 


1134. 


, 6 


75 


.9 


j. *± y o 


i n 

X u 


TTQ-1 0-777 -T^^n.TO 
U O X U /Zj jDDJ3 _ jZ 


Sequence 32, Appl 




1 1 


1111. 


, 4 


74 


.3 


1 4 £ 4 


7 


TTQ-1 0-079 -797A-40 


Sequence 


40, 


Appl 




12 


1107. 


2 


74 


.1 


1 4 ft 1 

X 1 O J. 


7 


TTQ - 1 0 - 7 97 - -1 
uo xu oyt jji x 


Sequence 


1, 


Appl i 




1 7 

X ~> 


1107. 


2 


74 


.1 


1 4 ft 1 
JL H. O X 


1 7 

X J 


TTQ-11 -117-7^7 1 
UO ~ XX ~XXZ Z j / - X 


Sequence 1, 


Appl i 


Q 


1 4 


1104. 


6 


73 


.9 


1 

1? J J 


7 


TTQ-1 0-464 - 774 - 4 


Sequence 


A 


Appl i 


r* 


1 ^ 

X D 


1104. 


6 


73 


.9 


X *± 3 J 


7 

f 


TTQ-1 0 - 4 £ 4 .'JCC _ A 


oequence 


A 


Appl i 


Q 


16 


1104. 


6 


73 


.9 


1 4R7 


7 


TTQ-1 0-464 - 70Q-4 
UO XU *±0*± /U-? *x 




4 


Appl 1 




1 7 
X / 


1099 


73 


.5 


IjjU 


Q 


TTQ-1 ^-07^-1^1 1 


Sequence 


1, 


Appl 1 




1 ft 


1098. 


6 


73 


.5 


1 4 ft R 


7 

i 


TTQ-1 0 - 1 6 ft - 77721-1 0 
UOXUXOO JJ /n 1U 


Sequence 


10, 


Appl 




19 


1095 


73 


.2 


1 ^76 


1 6 

X D 


TTQ-11 - 1 9fi - 747 - 1 

U O XX X-70 ZlZ X 


Sequence 


! 1, 


Appl i 




2 0 


1090. 


8 


73 


.0 


i 46 o 


7 
1 


TTQ-1 0-16ft - 777 A - 6 
UOXUXOO jj / r\ ~ O 


Sequence 


6, 


Appli 


Q 


2 1 


1089. 


4 


72 


.9 


1 CTQ 
1 J J 7 


7 
1 


TTQ-1 0-464 - 774 - 6 
UO XU *xO*± /Zi o 


Sequence 


6, 


Appli 


Q 


22 


1089. 


4 


72 


.9 


153 9 


7 


TJQ -10-464 - 7 R6 - 6 

U O XV Ivt J JD D 


Sequence 


6, 


Appli 


Q 


2 3 


1089. 


4 


72 


.9 


1 CTQ 
1 JO? 


7 


TTQ-1 0-464 - 7DQ - £ 
UO XU""*±0*±~ / U-7 0 


Sequence 


6, 


Appl i 


Q 


24 


1083. 


8 


72 


.5 


146 7 


7 


TTQ-1 0-464 - 774 - 7 


Sequence 


2 , 


Appl i 




2 5 


1083 . 


8 


72 


.5 


146 7 


7 


TTQ-1 0-464 - 7R6 - 7 


Sequence 


2, 


Appl i 


Q 


26 


1083. 


8 


72 


.5 


146 7 


7 


TTQ-1 0-464 - 709-7 
uo xu *±o*± i\jy & 


Sequence 


2, 


Appli 




27 


1083 . 


2 


72 


.5 


1454 


7 


TTQ -1 0-464 - 774 - 7 

UO XU *± O / / 


Sequence 


7, 


Appl i 




7 ft 


1083 . 


2 


72 


.5 


1 4 R 4 


7 


TTQ-1 0 - 4 £4 - 7c:<: _ 7 
UO XU rxOri ™ J 3D / 


Sequence 


7, 


Appl i 




2 9 


1083. 


2 


72 


.5 


1 4 R 4 


7 


TTQ-1 O - 4 £ 4 - 709 - 7 
UO XU *±0*± / \jy I 


Sequence 


7, 


Appl i 




3 0 


1082. 


4 


72 


.4 


i 46 o 

J. 1 D U 


7 


TTQ-1 O-l £ ft - 7 7 7 A - 4 


Sequence 


4, 


Appli 




3 1 


1080. 


4 


72 


.3 


145 8 


Q 


US -10-659-94RA-1 


Sequence 


3, 


Appli 




32 


1080. 


4 


72 


.3 


1458 


8 


US-10-659-980A-3 


Sequence 


3, 


Appli 




33 


1080. 


4 


72 


.3 


1458 


8 


US-10-659-983A-3 


Sequence 


3, 


Appli 




34 


1078. 


4 


72 


.1 


1459 


7 


US-10-168-337A-8 


Sequence 


8, 


Appli 




35 


1078 


72 


.1 


1478 


7 


US-10-168-337A-3 


Sequence 


3, 


Appli 




36 


1077. 


8 


72 


.1 


1491 


8 


US-10-659-948A-20 


Sequence 


20, 


Appl 




37 


1077. 


8 


72 


.1 


1491 


8 


US-10-659-980A-20 


Sequence 


20, 


Appl 




38 


1077. 


8 


72 


.1 


1491 


8 


US-10-659-983A-20 


Sequence 


20, 


Appl 




39 


1076. 


6 


72 


.0 


1405 


11 


US-10-515-311-5 


Sequence 


5, 


Appli 




40 


1075. 


4 


71 


.9 


1426 


7 


US-10-168-337A-9 


Sequence 


9, 


Appli 




41 


1072. 


8 


71 


.8 


1460 


7 


US-10-168 -337A-2 


Sequence 


2, 


Appli 




42 


1071. 


8 


71 


.7 


1460 


7 


US-10-168 -337A-1 


Sequence 


1, 


Appli 




43 


1060. 


8 


71 


.0 


1457 


8 


US-10-659-948A-1 


Sequence 


1, 


Appli 




44 


1060. 


8 


71 


.0 


1457 


8 


US-10-659-980A-1 


Sequence 


1, 


Appli 




45 


1060. 


8 


71 


.0 


1457 


8 


US-10-659-983A-1 


Sequence 


1, 


Appli 



ALIGNMENTS 
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